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ABSTRACT 


Background: Cellulases are hydrolytic enzymes which hydrolyze P-1,4-glycosidic linkage in cellulose and these are present in 
many microorganisms including bacteria, fungi and protozoa. The three types of cellulases involved in complete hydrolysis of 
cellulose are endoglucanase, exoglucanase and P-glucosidase. Various structural and functional domains are present in cel¬ 
lulases and among all these domains, cellulose binding and catalytic domains are found to be important for the hydrolysis of cel¬ 
lulose. Cellulases have showed promising applications in different industrial sectors like paper and pulp, textile, laundry, bioetha¬ 
nol production, brewing, detergent and waste management. A major focus has been given in the recent past by researchers to 
understand the functional domains and catalytic mechanism of this enzyme to make their effective use for industrial applications. 

Material and Methods: The protein sequences of cellulases belonging to different Bacillus sp. were retrieved using Uniprot 
and then physicochemical properties were analyzed using ProtParam and Protscale. Multiple sequence alignment of retrieved 
sequences was performed using Clustal W and phylogenetic tree was constructed using Mega 6.0 software. SOPMAand GOR 
IV tools were used for the prediction of secondary structure. The tertiary structure of enzyme was computed using Raptor X. 

Results: The molecular weight of cellulases were found to range between 49,263-94,682 Da with hydropathicity ranges between 
-0.292 to -0.580. The acidic amino acid glutamate was found at the active site and methionone was found at the N-terminal of 
enzyme. The results have shown that the sequence is highly diversed at N-terminus and C-terminus region between different 
types of cellulases with conserved sequences in the middle. The phylogenetic tree has showed high similarity amongst retrieved 
sequences. From the tertiary structure, a great degree of variability in a-helix, extended strand in (3 ladder, hydrogen bonded 
turn, bend and coil was observed between different types of cellulases. 

Conclusion: This study provides insights about the physicochemical properties, hydrohobicity, structure and function of cellu¬ 
lases, which would help to exploit this enzyme at industrial level. 
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INTRODUCTION 

Cellulases are hydrolytic enzymes which can readily hydro¬ 
lyze both crystalline and paracrystalline structures of cellu¬ 
lose, the largest component of plant residues enters terrestrial 
ecosystems. Cellulose is present in various lignocellulosic 
wastes generated from agricultural and industrial processes 
like sawdust, citrus peel waste, paper mill sludge, industrial 
waste, paper pulp and municipal solid waste (Maki et ah, 
2009). The synergistic action of three types of cellulases i.e. 
endoglucanases (EC 3.2.1.4), exoglucanases (EC 3.2.1.74) 
and p-glucosidase (EC 3.2.1.21) have been involved for the 
complete hydrolysis of cellulose (Lugani et ah, 2015). Cellu¬ 
lases are industrially important enzymes and are involved in 
the conversion of lignocellulosic residues for the production 


of single cell protein, ethanol, bleaching of pulp, fruit juice 
extraction and for the treatment of waste papers (Shankar 
and Isaiarasu, 2011). Cellulases are produced by all the mi¬ 
croorganisms but mainly by bacteria, actinomycetes and fun¬ 
gi. Among all the microorganisms, members of bacteria have 
gained intense importance for commercial production of cel¬ 
lulases due to their high growth, wide genetic variability, 
adaptability and high amendability to genetic manipulations 
(Patagundi et ah, 2014; Lynd et ah, 2002). The structure of 
cellulase composed of carbohydrate binding domain (CBD) 
at C-terminal which is joined by a short poly- linker region to 
the N-terminal of catalytic domain. There is presence of two 
acidic amino acids at the active site of enzyme which catalyze 
the reaction by acid-base catalysis either through inversion 
or retention of the configuration of anomeric carbon (Maki 
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et al., 2009). The current focus of most of the researchers has 
been towards the large scale production of this industrially 
important enzyme to meet the industrial needs by utilizing 
various novel bacterial strains. However, a great degree of 
variability have been observed between different bacterial 
strains like molecular weight, stability, amino acid composi¬ 
tion, family and domain to which they belong, secondary and 
tertiary structure. Bioinformatics is an interdisciplinary field 
which is currently used for structural and functional analysis 
of proteins using various computations tools and databases 
(Prashant et al., 2010). The infonnation which has been re¬ 
trieved from available tools and databases about the protein 
might be useful for the selection of highly efficient bacterial 
strain for industrial production of enzyme. Moreover, this in¬ 
formation may also be helpful for developing new microbial 
strains with enhanced enzyme production ability by adapting 
recombinant DNA technology. Keeping in view the above 
facts about industrial importance of cellulases and use of 
bioinformatics as an emerging field of molecular biology, 
the present study was aimed to utilize in silico tools for the 
characterization of cellulase enzymes from different Bacillus 
species for their physicochemical characteristics, ancestral 
relationship and structure determination at various levels. 

MATERIAL AND METHODS 

Sequence retrieval and alignment 

The cellulase protein sequences from different species of 
Bacillus were retrieved from Uniprot (Universal Protein 
Resource). The retrieved sequences from Bacillus sp. were 
Bacillus subtilis, Accession number: PI0475; Bacillus 
akibai, Accession number: P06564; Bacillus thuringiensis, 
Accession number: M1QQC9; Bacillus pumilus. Accession 
number: B2ZHC9; Paenibacilluspolymyxa, Accession num¬ 
ber: E3EEC5; Paenibacillus macerans, Accession number: 
A0A090Y895. Clustal Omega (version 1.2.4) algorithm 
was used for the alignment of retrieved protein sequences 
through multiple sequence alignment. 

Physicochemical characterization 

ProtParam tool was used to compute different physicochemi¬ 
cal properties of retrieving sequences of cellulases like ami¬ 
no acid composition, aliphatic index (Al), pi, instability in¬ 
dex (II), number of positive and negative charged residues, 
grand average of hydropathicity (GRAVY) and extinction 
coefficient (Kumar et al., 2012). The isoelectric point (pi) is 
detennined based on the pK value of protein during protein 
migration under denaturation conditions (Bjellqvust et al., 
1993). The concentration of purified protein in the sample is 
evaluated from the value of extinction coefficient (Umang et 
al., 2012). The stability of protein is calculated from its insta¬ 
bility index (II) and the proteins are predicted as stable when 



their instability index is smaller than 40; however when the 
value of instability index is greater than 40, the protein is re¬ 
garded as unstable (Guruprasad et al., 1990). The volume oc¬ 
cupied by aliphatic amino acids side chain (alanine, valine, 
leucine and isoleucine) relative to total volume occupied is 
called aliphatic index and it determines the thermostability 
of a globular protein (Walker, 2005).The hydrophilicity or 
hydrophobicity of protein is determined by grand average of 
hydropathicity (GRAVY), which is the ratio of sum of hy¬ 
dropathy values of all the amino acids to total number of res¬ 
idues in the sequence (Umang et al., 2012). The hydropathy 
plots based on Kyte and Doolittle scale for all the retrieved 
sequences of cellulases were predicted using Protscale tool 
(Kumar et al., 2012; http://www.expasy.org/tools/). 

Phylogenetic analysis 

The ancestral relationship between retrieved protein se¬ 
quences of cellulase for different species of Bacillus was 
estimated by constructing the phylogenetic tree using Mega 
6.0 software (Gouripur et al., 2016). Neighbor joining (NJ) 
algorithm was used for distance tree building and bootstrap 
value was set at 1000. The bootstrap value denotes to genera¬ 
tion of new data sets with replacements. 

Secondary structure prediction 

Self optimized prediction method with alignment (SOPMA) 
and GOR IV tools were used for the analysis of secondary 
structure and results obtained from these tools were also 
compared to determine a- helix, P- sheet, turns and loops 
(Geourjon and Deleage, 1994; http://npsa-pbil.ibcp.fr/cgi-). 

Tertiary structure prediction 

The tertiary structure of proteins were constructed by using 
RaptorX structure prediction server, which provides high 
quality structural model by using the template of primary 
protein sequence (http://raptorx.uchicago.edu/StructurePre- 
diction/predict/). 

RESULTS 

Sequence retrieval and alignment 

The protein sequences of cellulase enzymes belonging to dif¬ 
ferent strains of Bacillus sp. were retrieved from Uniprot and 
these sequences were then characterized using Uniprot tool 
(Table 1). It has been analyzed from data that the molecular 
weight of enzyme lies between 54,681 to 94,682 Da and they 
belong to endoglucanase, P-glucosidase and exoglucanase, 
respectively. Different types of cellulases shows different 
catalytic mechanism due to variation in multienzyme com¬ 
plex formation. 

Thereafter, clustal omega software was used for multiple 
sequence alignment of these proteins (Fig. 1). The cellulase 
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sequences from species of Bacillus were found to be high¬ 
ly diverged at the N-terminal and C-terminal, respectively. 
However, conserved amino acid sequences with good simi¬ 
larity were found in the middle. The positions with absolute 
conservation are indicated with asterlink (*), whereas dots 
(.) represent the position of relative conservation. 

Physicochemical characterization 

The results of physicochemical properties like pi, number 
of positive and negative amino acids, extinction coefficient, 
instability index, aliphatic index, grand average of hydro- 
pathicity and total number of atoms for cellulase from dif¬ 
ferent different species of Bacillu .vare shown in Table 2. The 
isolectric point (pi) is the pH value at which mobility of pro¬ 
tein becomes zero with more compact and stable conforma¬ 
tion. The pi value of cellulase from Bacillus subtilis is more 
than 7, which means it contains more number of negatively 
charged amino acids. Whereas, for all the other species of 
Bacillus, the pi was found to be less than 7 and their cellu¬ 
lases were acidic in nature containing more number of posi¬ 
tive charge residues. Expasy’s Prot Parana can compute the 
extinction coefficient for a range of 276, 278, 279, 280 and 
282, however 280 nm is more preferred because proteins ab¬ 
sorb this wavelength more strongly with minimum interfer¬ 
ence from other substances. Cellulases from all the selected 
Bacillus sp. were found to be stable with instability index 
less than 40. The GRAVY value is negative for all the cel¬ 
lulase sequences and this has showed better possibilities of 
aqueous interactions. The total number of atoms in different 
cellulases ranging from 7717 to 13113.The comparison of 
amino acid composition (%) in different cellulase sequences 
was also carried out (Table 3) and different amino acids were 
found to be dominant in different sequences. Protscale tool 
was used for the construction of Kyte and Doolittle hydropa¬ 
thy plots (Fig.2) and transmembrane region of cellulase from 
different Bacillus sp. was found to be rich in hydrophobic 
amino acids as many points lie above the zero baseline. The 
minimum and maximum hydrophobic position and score for 
each cellulase sequence was also predicted (Table 4) with 
minimum and maximum score of -3.100 and3.489, respec¬ 
tively. 

Phylogenetic analysis 

Mega 6.0 software, which provides a subset of substitu¬ 
tion model and neighbor joining algorithm for distance tree 
building, was used for the construction of phylogenetic tree 
(Fig. 3). Cellulases from Bacillus subtilis P10475, Bacillus 
pumilus B2ZHC9, Paenibacilluspolymyxa E3EEC5, Paeni- 
bacillus macerans A0A090Y895 were found to be closely 
related compared to Bacillus akibai P06564 and Bacillus 
thuringiensis M1QQC9, which were diverged from many 
species of Bacillus. 


Secondary structure prediction 

The secondary structure of different cellulase sequences was 
estimated using GOR IV and SOPMA tools, the percentage 
of a-helix, extended strand and random coils incellulase from 
different Bacillus sp. are shown in Table 5. The presence 
of amino acid in the helix, strand or coil is depicted from 
the secondary structure (Ojeiru et ah, 2010) and secondary 
structure of cellulase was depicted by SOPMA (Pradeep et 
ah, 2012). From the results, it has been observed that alpha 
helix was dominant in Bacillus pumilus B2ZHC9 (46.52%), 
whereas extended strand and random coil was observed to be 
dominant in Bacillus subtilis PI 0475 (26.45%) and Paeniba¬ 
cillus polymyxa E3EEC5 (44.91%), respectively. 

Tertiary structure prediction 

The tertiary structures of different cellulase sequences were 
analyzed using RaptorX structure prediction server, which 
results in modelling of a protein in a step wise manner like 
template threading, alignment quality assessment and mul¬ 
tiple template threading. Different sequences of cellulases 
showed variability in a-helix, extended strand in p ladder, 
hydrogen bonded turn, bend and coil (Fig. 4). A high degree 
of variation was found in the geometric shape and interaction 
between side chains of amino acids, which lead to differ¬ 
ences in their lunctions. 


DISCUSSION 

The present study showed the industrial importance of cel¬ 
lulases in different sectors such as textile, laundry, bioetha¬ 
nol production, brewing, detergent, waste management, 
paper and pulp. There are different types of cellulases with 
different catalytic subunits which are involved in complete 
hydrolysis of cellulose. In silico studies are very promising 
tools in the current era for the characterization of industrially 
important enzymes for various properties for their selection 
for appropriate industrial application. The different types of 
cellulases from Bacillus sp. were studied for comparing their 
physicochemical characteristics, ancestral relationship and 
structure prediction at different levels. Various computation¬ 
al tools were used for the characterization of cellulases from 
different Bacillus sp. A great degree of diversity has been 
observed in molecular weight, family, domain, number of 
amino acids, positive and negative charged residues, second¬ 
ary and tertiary structure between the different fonns of cel¬ 
lulases. The phylogenetic analysis also showed the ancestral 
divergence of difference types of cellulases. This study will 
be helpful for the selection of industrially important bacte¬ 
rial strain with desirable characteristics for particular indus¬ 
trial processes. Moreover, this information can also be useful 
for designing new microbial strains by applying proteomics, 
system biology and microarray based strategies. Further, wet 
lab studies with reduced labour are required to design novel 
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cellulase producing bacterial strains by using in silico data 
output. 

CONCLUSIONS 

The present study concludes that cellulases are industrially 
important enzymes due to their ability to utilize agricultural 
wastes for the production of industrially important products. 
In recent years, various researchers have adapted genetic en¬ 
gineering approach to develop novel microbial strains with 
enhanced enzyme producing ability. However, such pro¬ 
cesses are very tedious and tune consuming, which is the 
major obstacle for adapting such processes for commercial 
enzyme production. Such limitations can be overcome by 
initial screening through in silico studies to understand the 
structure, function and physicochemical properties of the en¬ 
zyme. These computational studies would be promising tool 
to design enzymes with desirable characteristics for exploit¬ 
ing them at industrial level. 
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Table 1: Characterization of retrieved sequences of cellulases for different Bacillus sp. using Uniprot tool. 


s. 

No. 

Organism 

Accession 

number 

Protein 

Number 
of amino 
acids 

Molecular 
weight 
(in Da) 

Molecular 

function 

Family 

Domain 

1 . 

Bacillus subtilis 

P 10475 

Endoglucanase 

499 

55.287 

CA 

gh 5 

CBM 3 







CB 



2 . 

Bacillus akibai 

P 06564 

Endoglucanase 

800 

88,602 

CA 

gh 5 

CBM 17 







CB 


CBM 28 

3- 

Bacillus thuring- 

M 1 QQC 9 

(3-glucosidase 

469 

54,681 

BGA 

GHi 

GHC 


iensis 








4- 

Bacillus pumilus 

B 2 ZHC 9 

(3-glucosidase 

488 

56,352 

HA 

GHi 

GHC 

5- 

Paenibacillus 

E 3 EEC 5 

Exoglucanase 

717 

77,018 

CA 

GH 6 

CBM 6 


polymyxa 





HA 



6 . 

Paenibacillus 

A 0 A 090 Y 895 

Exoglucanase 

889 

94,682 

CB 

GH 6 

CBM 3 


macerans 





HA 




where, GH- Glycoside Hydrolase, CBM- Carbohydrate Binding Module, GHC- Glycoside Hydrolase Catalytic, CA- 
Cellulase activity, CB- Cellulose binding, BGA- P-galactosidase activity, HA- Hydrolase activity 


Table 2: Cellulases parameters computed using Expasy’s Prot Param tool. 


S. 

No. 

Organism 

Accession 

number 

Pi 

+ R 

-R 

EC (M 1 
cm 1 ) 

II 

Stability 

AI GRAVY 

Formula 

TNA 

1 . 

Bacillus 

subtilis 

P 10475 

8.55 

59 

55 

106925 

26.16 

Stable 

73.91 - 0.580 

C H N O S 

2459 3815 673 760 10 

7717 

2 . 

Bacillus 

akibai 

Bacillus 

P 06564 

4-3 

59 

133 

147250 

34.10 

Stable 

80.21 - 0.456 

C s H r N O , S 

3938 6000 1040 1263 15 

12256 

3- 

thuringien- 

sis 

M 1 QQC 9 

5-74 

57 

69 

127785 

30.86 

Stable 

73-37 "0-574 

C H N, O S 

2510 3729 639 715 12 

7605 

4- 

Bacillus 

pumilus 

Paeni- 

B 2 ZHC 9 

4-99 

56 

78 

91930 

36.44 

Stable 

76.39 - 0.506 

C H N O, S 

2543 3856 650 761 20 

7830 

5- 

bacillus 

polymyxa 

Paenibacil- 

E 3 EEC 5 

5-5i 

65 

76 

117480 

23-73 

Stable 

70.95 -0.420 

C H N O S 

3422 5276 920 1079 *4 

10711 

6 . 

lus macer- 

A 0 A 090 Y 895 

4.67 

64 

98 

156330 

23.15 

Stable 

73.35 - 0.292 

C H N O S, 

41 QQ 6431 111 3 1 3 54 16 

13113 


arts 












where pi: Isoelectric point, +R: number of positive charged residues (Arg+ Lys), -R: number of negative charged 
residues (Asp+ Glu), EC: Extinction coefficient at 280 nm, II: Instability index, Al: Aliphatic index, GRAVY: Grand 
average of hydropathicity, TNA: Total number of atoms 


Table 3 : 

Amino acid composition (%) of cellulases computed using Expasy’s ProtParam tool. 

S. No. 

Amino acid 

P10475 

P06564 

M1QQC9 

B2ZHC9 

E3EEC5 

A0A090Y895 

1 . 

Ala (A) 

7.0 

8.4 

5-5 

7-4 

10.0 

11.9 

2 . 

Arg (R) 

2.8 

3-i 

3.6 

3-9 

2-5 

2-5 

3- 

Asn (N) 

7.6 

8.4 

5-i 

4-3 

6.8 

6.3 

4- 

Asp (D) 

7-2 

7-4 

6.2 

7.8 

7-5 

7-i 

5- 

Cys (C) 

0.6 

0.0 

0.6 

1.0 

0.6 

0.4 

6 . 

Gin (Q) 

4.2 

2.2 

2-3 

3-9 

2.6 

2.8 
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7 - 

Glu (E) 

3.8 

9.2 

8.5 

8.2 

3 -i 

3-9 

8. 

Gly(G) 

9.0 

7-2 

8.1 

6.4 

8.6 

8.2 

9 - 

His (H) 

i -4 

1.6 

3-2 

2.0 

1.4 

1.0 

IO. 

lie (I) 

6.4 

5-9 

6.2 

6.6 

4.6 

4.0 

11. 

Leu (L) 

7.0 

6.9 

6.6 

7.8 

6.6 

5-5 

12. 

Lys (K) 

9.0 

4.2 

8.5 

7.6 

6.6 

4-7 

13- 

Met (M) 

i -4 

1.9 

1.9 

3 1 

1.4 

i -3 

14- 

Phe (F) 

3.0 

3-4 

4-9 

4-7 

3-3 

2.8 

15- 

Pro (P) 

3-4 

6.1 

4 -i 

3-7 

6.7 

6.1 

l6. 

Ser(S) 

7-4 

5-5 

3-4 

4-7 

7 -i 

8.0 

17- 

Thr (T) 

6.8 

5-2 

4-7 

4-3 

8.8 

8.3 

18. 

Trp (W) 

2.8 

2-5 

3.0 

1.6 

2.0 

1.9 

19- 

Tyr (Y) 

4.0 

3 -i 

7-2 

6.6 

3.8 

4-7 

20. 

Val (V) 

5.0 

7.6 

6.2 

4-5 

6.0 

8.3 


Table 4: 

Hydrophobic score and position of cellulases using Protscale tool. 



S. No. 

Organism 

Accession number 

Position 

Min Max 

Score 

Min 

Max 

1 . 

Bacillus subtilis 

P10475 

143 

13 

-3.044 

2.811 

2. 

Bacillus akibai 

P06564 

794,795 

1718 

-3.044 

3.489 

3 - 

Bacillus thuringiensis 

M1QQC9 

57 

169 

-3.100 

1.878 

4 - 

Bacillus pumilus 

B2ZHC9 

68 

320 

-2.889 

1.922 

5 - 

Paenibacillus polymyxa 

E3EEC5 

366 

21 

-2.800 

2.044 

6. 

Paenibacillus macerans 

A0A090Y895 

857 

16,17 

-2.400 

1.889 


Table 5: Prediction of secondary structure of xylose reductases using Expasy’s GORIV and SOPMA tool. 


S. No. 

Organism 

Accession 

GOR IV analysis 


SOPMA Prediction 




number 

a-helix 
(Hh) (%) 

Extended 
strand (Ee) 
(%) 

Random 
coils (%) 

a-helix 
(Hh) (%) 

Extended 
strand 
(Ee) (%) 

Random 
coils (%) 

1 . 

Bacillus subtilis 

P10475 

18.24 

28.66 

53.11 

22.85 

26.45 

36.47 

2. 

Bacillus akibai 

P06564 

28.38 

17.25 

54-37 

27.88 

23-75 

38.25 

3 - 

Bacillus thuringiensis 

M1QQC9 

29.64 

23.88 

46.48 

35.18 

22.39 

30.06 

4 - 

Bacillus pumilus 

B2ZHC9 

39-34 

18.24 

42.42 

46.52 

15.16 

28.28 

5 - 

Paenibacillus polymyxa 

E3EEC5 

19.80 

20.36 

59-83 

22.32 

22.04 

44.91 

6. 

Paenibacillus polymyxa 

A0A090Y895 

19.69 

23.51 

56.81 

25.08 

21.82 

41.96 



Int J Cur Res Rev | Vol 9 • Issue 1 3 • July 201 7 













Lugani et.al In si/ico Characterization of Cellulases from Genus Bacillus 


V nQQOlOQQC! MTU .. tr lOQCf IWQQO.WCT. 

tr 122X9 122K9>* .~. tr|U2XSK2K9J-£*. 

tr EXECS EXECS.MCK * f ffg 

tr W^f Ym| W 9T19S MW .OQVTIXWAI^"#lSi(ri^»ISUaW^SlffH»Da.i^f®T !f M4fX*»t>_MEM 

% imtvoaj ucsu ” ... if aju i*Hu 

if WMlaM.IIU) ..... 1f|Mtf*4 8A MOi 


.40ufiar*i<T\«;rt.tpmwK 

.I^JCHItllOKEgKQJ.-Ul • -a*** 

»rouk*>CTT>Trrt^Tv«rs<^ . 

vtrym-.'Gn>yt<v**JQC^ •**** 

imasirtai. 

WUBMH>.-... 


tr 100(9 PWf.MTU 
tr 122X9 3230.MCPJ 
tr EXECS itXKS.MCrS 
tr *WW^W51 *W>M9)_WW 
if imtsioiu ucsu 
if mum oa lAOi 

tr 1QQ(9 «0QCt MC\ 
tr W2K9jl2ZX9.MC*i 
tr EXKSlEXKS.MtfS 

tr 1*1**'!*! fUMMts.nem 

If 1#TS O1C ucsu 
If M6W4 0A.MO3 

tr *100(9 H50QCI.MTU 
tr 121X9 922K9_MC*. 
tr EXKVHffCt MftS 

tr iaM9rrt9*|iMwri?s Mm 

if IMtt OA2.IXSU 
if mbu oiii.Ma3 

tr 10QC9 MQQC9 MOV 
tr 122X9 K2K) ^ 
tr EXECS EXCCS.MCPS 

tr >ft!#9frW 1 imw»19S WW 

If IMniOIC.BXW 
if MfcSM GUIUCAJ 

tr 10QO | MQQC9MCTU 
tr 122X9.122K9MCR; 
tr EXECS EXCCS.MtrS 
tr IM^IK |*WW , W5 > HW 
If UMTS OIC.IACSU 
if M*SM OA MUi 

tr 10Q(9JQOQ(«.MCTU 
tr 122X9|I22X9_1KJ»U 
tr EXKSI EXE(5~MIPS 
tr M»5erJ95|*M«M9>.M£M 
If UMTS OA2 UCSU 
If M65C4 0A.MO3 

tr *10QC9 IOQQC9.MTU 
tr I22X9II22X9 MCPu 
tr EKKSltKKS.MtfS 

tr 4 ai#9rr9*|4il *» 19s Mm 

If 1**5 04C.UCSU 
If M6SM 011.1403 

tr *100(9 MQQC9.MCTU 
tr 122X9 I22KJ_MOU 
tr EXKSlftffCS.Mm 
tr 1 ku *W 'vr> Mm 

if wrsoie.Mau 
if *6*64 0M.S4O3 

tr *100(9 PO0QC9.MCU 
tr 122X9 l22K9*MOv; 
tr EXKSISXECS.Mm 
tr M14|4f9T ffr Mm 
If PltWlOIO.IKV) 
if mbu 0«_a4O3 


rrxrftwtiooMASX&nTQiacTmi^^ 

r:*4S^nmico«os«inTgD4i£Tnu»xvi«if9#i>to:4i^ 


■HUKdlOllS-S!.41— 


tr|*100(9 "W9MT- 
tr1122X1 122X9 %U.K 
trIEKKS IXICslwm 
\r *Mm6i«m 

If IMM7S 0M.I4CSU 
If M6SC4 0A_MO3 


iiri^i»vfr»txiicvf^iri'iiwsHWv9^. 

ISr#s»MO.itou—• 4001091.— 


19MMKKSVM*.MDS^IWTliOS^TyTriMXSTV^W 

vHMiMOC.SSS4UK4-.StfOGAtSD 

4H61TWX.V564nXfl0C9A*.*k0K*>^UW 


tt40SJX¥mirTroEEQ4Cta:*btf^Tiifiuii9i^^ 

i [ ••% jv vtx s^kjumssa: 'Xk: TVAja*wwrx: c*** cx* 


L^iistM’TXAXomtnri.. -hlgxm 


MU4TVS1 IT ’ IICrSttA4S«0» 2 TNMMTSX TfiXP0MlN!ttQ4KSSlCtn 
M QMMI SOT* VKT4MT r* ft* PWftl QH6gt>U4*CYtt 

.61S- • SXlO 

vmpka-. -jLkxmr* v.-opacioucw- • txlq 


.ASj^fG»«aorn*ar$«:>oi • -iinxzwxHxtY 

.4£lTWI0P®(—.fOMIMflBWMf 

HmW5y4£5lWVUO«S(&“—-HCTQNiffMMTS—4¥ 

n^«A»TM(vs(#¥ica^sc:b-.ft ::‘*cttfl0MI. as 

If g nM Pk M—“IIBCMBT*.44A"14£W»IIIAS‘.V* 

#?tIJCMXA . \jec*9*l . 4UrrxW46Xf .. ll 

UttkNmmm-iSJMnmowm- • -Ecurow^c^rvcitn 

.#*—oi 

GSC2¥m&SGIUWti>flBX^ .4PQAY-• 4H 

«Mr^C£AtA4«lGWinMa JC* • 4*X?**EU40F-• xmSlYWTM 
tvrr 1IU CUI KO^IVOMMIM-• 4CTX^40M)4I • 

Vh*MXhfcf.•ca^Xlt|M4W4H(5UKM->» , *ifc!Ttli 

VT^WWITl!...MVNfaPS*!* .•ftr’IJUmtVfflC-'WMrTHi 

io»mwK^sk:xcwsH)«xDrTr , o<ri T >»Jwisohm> 

«*IM«9®5DGSSU:*CWOT 

VMIMXMVCt.-Oil.M£* 

i:*IJXIS9M6MaWXE«aM[.£>4D« 

tmosis* .u&i -mcvto*. . 

(•WXNKV- 1 W 1 lRX 9 M[IQ 4 «- 

. 

OlrtL WrMJfl SOfcOl-KITMMWr L ;AA7llft6Mt-S4KS>tMJUlSAi«F 

hTSWi.4.VCWIWC^OXVIU^- . 

fftWJ.460—"^ WfiDC!X5M«0"^A» . 

.vfgrwur.-4i1V«r(«0U5»G£2£JlMLM^ 

. »ATva4«::p»f^2GCXx:wY 

.m m 

M.^mto.rYTi2tsasnTYYvws»x4a5«sw 

.. 

.vno4mwwYT09U6T.iyr.f 

Sl0CO3CtUMHVCltr4Y)9:LICG^A\. 

. 

rso-.uAtvw^nv. .onsiMosnv* • • •a/rsoMf TMP.m 

m .—iuxjM 6 M*#u#nrt 6 

.--WUWV---4lSA44r[^H&TSD4VJIi6*l.aO«r^-- 

x? . mktom* m n «ii«M^ATMiso4xmGrriouoy^ 


tr 1100(5 moQC9_IX\ 
tr 122X9 122X9 IKFJ 
tr fUKI HHCS MIPS 
tr|4Mf9f*99S 
IfinMTS OM.MCV 
if M6S64 0*_MO3 

VI100C5 PJ00C9.MT. 
tr !22Xi »2^<9.MCP. 
trinaa Diccs.MiPs 

tr | uwsrrw ] Aiiiflww »«m 

IfllMTS OJK2 AKSU 
ifIM6544 0H.I4O3 

tr lOQCi KtOCO.WT. 
tr 1122X5 522K) W. 
trlEXICS E3([CS]W5 
tr »i!f5<vJ95 4f4f9mS_M£lU 
if KUK OJU I4CSU 
tf|M6««4 GUI B403 

tr | *300(5 WQOC 9 IX r. 
tr1122X9 I22X9.MCP. 
trlEXKS EXCCS.WEPS 
tr ttXMWS XM 

If ’1M75 OK.S4CSU 
if HMA 0JI.I4O3 

tr| 1 Q 0 C» miOOC9JXt- 
trj«22X9 I22X9.IXP. 
tr tJeKS eXSCS.MIPS 
tr| itmrtw \mmmjtoik 
tf|’lM75 QK.IXSV 
If M6S64 (AM bKXl 

tr 1*100(5 MQQC9.MCTH 
tr| 122X 5 122X 9 IKK 

trIEXKS CXECS.MffS 

tr IM5MK *W5t*W5_HW 

ifliMTs an ucsu 

lf|M4S44 UA 1403 

tr|100(5 »W9 MC. 

tr 1122X9 122X9 U&. 
tr [MK5 EMECS.N0S 
V|4Mf5W5 444f54*l»3_M£JU 
Sfl’lMtt QM_IXSU 
If M4SM OA MO) 

tr|100(5 MQQC9 Mcr. 
tr j 122X9 122X9 IXP. 
tr|EXK5 EXIC5.WIPS 

tr 4 B«*v B c M4f5e*Y9S MEM 
If >1M7S OA2.I4CSU 


.«XaTYTWMlSP€ffl£ 0 muVM 2 : 0 »^"l^ 

.•IIHh50(lflVKU!SliU<Gn0l4XHa 

.pfirtnaiMir' • • . 

MKSttSS . -CSMVS44flELnoGDUi4pY—4NGDSSA- 

isfscTT)APOLC^Tcr>nPs<i>p*w:sw 9 i. icxsa. 

iscsa AU^Gmp:D9*rrrrjr:r«c-iO(7c----v»^. 

«<CUTfC*UXC«-IIir... JL wpx- • •E<MCE4llW6YTMaE)t ISO 

.•MNW«W*VlAWTP 9 VSflcr>try»Titt^..^ > MIA^»X 

.IWQBPQWHPftGSTfrt ,SD* ,>rrV5 T IHgO»»>Q>MIAyfeS 

.UYlOflTO VS'Utt*r*M AGC • -IfOCD^IGK 

.fNViEXIOiaSG OX9fK^WfMIM4^AC QOCSYCICM 


.-.4 

TSp- .4 

AMT1.*OfH:7S444CTLMC— 

VWITf.T 0 SVX^TSOCSU«— 

lATXFVnxM.4«0-t--n-EL 0 «XTl 4 fG— 


U TXiaV-XPTTVSUAIPQmAAMmtAint^lllfVi KKKAK TI’SJCS 

«5W> UVXO- -XIAWCLS.3-NMAIMWK.— 

ify^lYVOTMCT jfrxtDS.»-nraouTKcOLS. 

irooiotwiecifiAJ^XJvsEy^ . 

G0T®lQtl4ACSS«2rXCfTTOftri*T0Ty<ii€(VT;V04CEl\4C:EWl* • — 

4s^3aqtiL»icc%6>MA5s<3C)rsrrcsii-^cTTtKrTL<)gccuxcrim-. 

muiMManmi.urv6»—mxMTnDCiviGTEvc]p/>€> 


la* WW^SG.XT Al * : • UX 3 McS-? ^ 7 S> 4 ^U 


m >mvx MUWYTionopMATiOAim **^r • ■w>w>« m ixo 


nfFA^v. v»r*mw«:Tiiigooni»ffci«aftfSi>«Nf4iAt<i<iA 


Figure 1: Multiple Sequence Alignment result of translated Cellulases Contig showing differences in 5’ and 3’ UTR. 
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Figure 2: Kyte and Doolittle Plots for cellulases from different microorganisms (A: Bacillus suM//sP10475; B: Bacillus ak- 
/tba/P06564; C: Bacillus thuringiensis M1QQC9; D: Bacillus pumilusB2ZHC9\ E: Paenibacillus polymyxaE3EEC5\ F: Paenibacil- 
lus maceransA0A090Y895). 


-Bacillus subtilis/P10475 

-Paenibacillus macerans/ A0A090Y895 

-Paenibacillus polymyxa/ E3EEC5 

-Bacillus pumilus/ B2ZHC9 

Bacillus akibai/ P06564 
-Bacillus thuringiensis/ M1QQC9 


I-1 

0.2 

Figure 3: Phylogenetic tree constructed by Mega 6.0 software (Molecular Evolutionary Genetics Analyses) with Neighbor Join¬ 
ing method showing evolutionary relationship among xylose reductase sequences from different origin. Bootstrap values are 
depicted at the nodes with Bar value of 0.2. 
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